🤖 feat: add deterministic stream guardrails for verification and doom loops by ibetitsmike · Pull Request #2476 · coder/mux

ibetitsmike · 2026-02-18T00:09:03Z

Summary

Adds two deterministic harness guardrails to the agent loop that enforce better agent behavior via the tool pipeline (not just prompting):

Pre-completion verification guard — gates agent_report to reject completion when the agent edited files but never ran any validation commands (tests, typecheck, lint). Allows through on a second attempt as an escape hatch.
Doom-loop detection — tracks per-file edit counts during a stream and injects a model-only nudge when the same file is edited 7+ times, telling the agent to step back and reconsider its approach.

Implementation

New per-stream tracker classes

StreamEditTracker — counts edits per file path, supports one-time nudge per file per stream
StreamVerificationTracker — tracks whether any validation-like bash commands were run, with a one-time nudge-then-allow-through lifecycle

Both are instantiated per-stream in aiService.ts and threaded through ToolConfiguration to tool factories.

Verification guard (`agent_report`)

Before returning { success: true }, checks if edits occurred and no validation was attempted
First attempt: throws with a clear error instructing the agent to run validation
Second attempt: allows through (escape hatch for tasks where validation isn't applicable)
Detection uses regex patterns matching common validation commands (make test, bun test, vitest, tsc, run_and_report, etc.)

Doom-loop nudge (`file_edit_operation`)

After each successful file write, records the edit in the tracker
At threshold (7 edits to same file), attaches a <notification> via __mux_notifications (model-only, stripped before UI/persistence)
Nudges once per file per stream (no spam)
Skipped in plan-only mode

Safety

All behavior opt-in by tracker presence in ToolConfiguration — IPC tool calls without trackers see zero change
Uses existing __mux_notifications infrastructure (already tested for stripping before persistence/UI)
Conservative defaults: threshold 7, one nudge, one verification block then escape hatch

Validation

All new unit tests pass (StreamEditTracker, StreamVerificationTracker, agent_report, bash, file_edit_operation)
make typecheck ✅
make lint ✅
make fmt-check ✅

Generated with mux • Model: anthropic:claude-opus-4-6 • Thinking: xhigh • Cost: $1.44

ibetitsmike · 2026-02-18T00:09:09Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 29df8afc44

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T00:15:12Z

@codex review

Addressed feedback: tightened run_and_report detection to only match when the wrapped command itself is a validation command. Added negative test case for run_and_report install bun install.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: e99336f295

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/file_edit_operation.ts

ibetitsmike · 2026-02-18T00:22:42Z

@codex review

Addressed: new-file creation in file_edit_insert.ts now records edits via editTracker.recordEdit() in the create-file branch, ensuring both guardrails cover file creation workflows.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a503345873

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T00:28:48Z

@codex review

Addressed: validation commands are now recognized after shell operators (&&, ||, ;, |), covering monorepo workflows like cd packages/app && make test and run_and_report unit cd packages/app && bun test.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ece687522d

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T00:35:13Z

@codex review

Addressed: removed .*? from run_and_report regex so only the actual command (third word) is checked. Added negative test for run_and_report note echo make test. Chained commands like run_and_report unit cd app && bun test still work because && bun test is caught by the standalone pattern.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: e27c3a7853

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T00:41:38Z

@codex review

Addressed with a clarifying code comment. Shell command parsing with regex is inherently imprecise — environment prefixes (env CI=1), shell wrappers (bash -c), etc., create an infinite regression of edge cases that regex alone can't solve. The agent_report escape hatch (second attempt always passes) is the designed safety net for any false negatives, adding at most one harmless retry. The current heuristic correctly handles the common cases (direct commands, &&/; chains, and run_and_report with direct validation commands).

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 16a071c69a

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/agent_report.ts

ibetitsmike · 2026-02-18T00:47:14Z

Pausing the review loop here and leaving this for human direction.

Re: bash-based edit tracking — This was deliberately scoped out in the original plan. Tracking file edits via bash would require parsing arbitrary shell commands to figure out which files they write to (redirections, sed, git, formatters, etc.), which is fundamentally unreliable and a much larger, separate problem.

The file_edit_* tools are the primary editing mechanism for agents — the vast majority of code changes go through them. Covering those handles the common case. Bash-based edits are an inherent limitation that's acknowledged and can be addressed as a follow-up if data shows it's needed.

The alternative (heuristically guessing which files bash commands modify) would introduce a high false-positive rate and make the guardrail unreliable in the other direction.

…loops

…mand run_and_report is a generic wrapper, not inherently a validation command. Only match when the wrapped command itself is a validation command (e.g., run_and_report typecheck make typecheck).

The create-file branch in file_edit_insert.ts bypassed executeFileEditOperation, so new files weren't counted by the edit tracker. This meant a stream that only created files could skip verification.

Match validation commands like 'cd packages/app && make test' and 'source .env; bun test' by accepting shell operators as command prefixes in addition to line start.

Remove .*? from the run_and_report pattern so that only the third word (the actual command) is checked against validation patterns. Chained commands after && are already caught by the standalone pattern.

Shell command parsing with regex is inherently imperfect. The escape hatch (second agent_report always passes) covers false negatives from env prefixes and shell wrappers.

ibetitsmike · 2026-02-18T08:58:18Z

@codex review

ibetitsmike · 2026-02-18T09:00:03Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b5757d4d7e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/agent_report.ts

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T09:07:15Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b5757d4d7e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/agent_report.ts

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T09:18:07Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 95d45718c2

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

src/node/services/tools/agent_report.ts

Background bash commands (run_in_background=true) haven't produced results yet when they start, so they shouldn't count as 'validation attempted' for the pre-completion verification guard.

ibetitsmike · 2026-02-18T09:25:36Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 07e9e67fcd

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/agent_report.ts

src/node/services/tools/bash.ts

…n't count Previously, running make test before editing files would permanently satisfy the verification guard. Now file edits reset the validation tracker, ensuring only post-edit validation counts.

ibetitsmike · 2026-02-18T09:34:09Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f8b64a38ff

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

ibetitsmike · 2026-02-18T09:41:57Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f8b64a38ff

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

src/node/services/tools/agent_report.ts

ibetitsmike · 2026-02-18T09:47:47Z

Note for reviewers

Codex keeps raising the same P1/P2 about tracking bash-based file edits (redirections, sed -i, git commands, etc.) on every review round — this has been raised and resolved 6+ times across review rounds.

This is an intentional design decision:

Reliably detecting file mutations from arbitrary bash scripts is infeasible with static regex
The escape hatch (second agent_report call always passes) handles this case
The guardrail catches the common case (tool-based file_edit_* edits) deterministically
Conservative defaults: one nudge, one block, then allow through

All other Codex feedback was addressed with code changes:

✅ Narrowed run_and_report regex to check actual wrapped command
✅ Tracked new-file creation in file_edit_insert
✅ Recognized validation commands after shell operators (&&, ;, |)
✅ Excluded background commands from verification credit
✅ Reset validation state on file edits (pre-edit validation no longer counts)

CI checks are all passing. Ready for human review.

ibetitsmike · 2026-02-18T09:47:52Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f8b64a38ff

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/node/services/tools/bash.ts

src/node/services/tools/agent_report.ts

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/file_edit_operation.ts Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/agent_report.ts Show resolved Hide resolved

ibetitsmike added 6 commits February 18, 2026 08:57

🤖 fix: add deterministic stream guardrails for verification and doom …

a23c173

…loops

fix: tighten run_and_report validation detection to check wrapped com…

60ec04a

…mand run_and_report is a generic wrapper, not inherently a validation command. Only match when the wrapped command itself is a validation command (e.g., run_and_report typecheck make typecheck).

fix: track new-file creation in edit tracker for verification guard

abb5fb5

The create-file branch in file_edit_insert.ts bypassed executeFileEditOperation, so new files weren't counted by the edit tracker. This meant a stream that only created files could skip verification.

fix: recognize validation commands after shell operators (&&, ;, |)

6f07a38

Match validation commands like 'cd packages/app && make test' and 'source .env; bun test' by accepting shell operators as command prefixes in addition to line start.

fix: narrow run_and_report regex to match only the actual command

46f3848

Remove .*? from the run_and_report pattern so that only the third word (the actual command) is checked against validation patterns. Chained commands after && are already caught by the standalone pattern.

docs: add comment explaining validation heuristic limitations

b5757d4

Shell command parsing with regex is inherently imperfect. The escape hatch (second agent_report always passes) covers false negatives from env prefixes and shell wrappers.

ibetitsmike force-pushed the mike/harness-guardrails branch from 16a071c to b5757d4 Compare February 18, 2026 08:58

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/agent_report.ts Show resolved Hide resolved

src/node/services/tools/bash.ts Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/agent_report.ts Show resolved Hide resolved

src/node/services/tools/bash.ts Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Show resolved Hide resolved

src/node/services/tools/agent_report.ts Show resolved Hide resolved

fix: exclude background commands from verification credit

07e9e67

Background bash commands (run_in_background=true) haven't produced results yet when they start, so they shouldn't count as 'validation attempted' for the pre-completion verification guard.

ibetitsmike force-pushed the mike/harness-guardrails branch from 95d4571 to 07e9e67 Compare February 18, 2026 09:25

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/agent_report.ts Show resolved Hide resolved

src/node/services/tools/bash.ts Show resolved Hide resolved

fix: reset validation state on file edits so pre-edit validation does…

f8b64a3

…n't count Previously, running make test before editing files would permanently satisfy the verification guard. Now file edits reset the validation tracker, ensuring only post-edit validation counts.

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Show resolved Hide resolved

src/node/services/tools/agent_report.ts Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

src/node/services/tools/bash.ts Show resolved Hide resolved

src/node/services/tools/agent_report.ts Show resolved Hide resolved

Comments

Conversation

ibetitsmike commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Implementation

New per-stream tracker classes

Verification guard (agent_report)

Doom-loop nudge (file_edit_operation)

Safety

Validation

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

ibetitsmike commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

ibetitsmike commented Feb 18, 2026 •

edited

Loading

Verification guard (`agent_report`)

Doom-loop nudge (`file_edit_operation`)